Collaborative Filter Pre-processing for Improved Corrupted Image Classification

نویسنده

  • Lucas Finn
چکیده

This paper investigates the effects of collaborative filtering on the classification of corrupted digit images. Several experiments were carried out using the MNIST digit dataset by applying a corruption model to the original images, two reconstruction algorithms, and an SVM classifier to measure classification accuracy at each stage of processing. The results demonstrate that collaborative filtering, when properly fit to the data, achieves higher accuracy than not filtering or using a Gaussian filter, and retains high accuracy even up to 85% image corruption. The experiments also show that classification performance drops precipitously when the collaborative filter is allowed to over-fit the training images during reconstruction. Introduction Collaborative filtering is widely known as the winning algorithm of the Netflix challenge for its ability to predict user preferences given highly sparse data [3]. More generally, it can be applied to matrix completion problems [1]. This paper investigates collaborative filtering as it relates to digit image classification on corrupted images using a support vector machine, with corruption simulating the sparse matrix completion problem. corrupt Original MNIST digit images SVM classifier Corrupted digit images SVM classifier Filtered digit images SVM classifier filter Figure 1 – Experimental setup for assessing SVM classifier accuracy on corrupted and reconstructed images. Figure 1 shows the experimental setup. The MNIST digit image dataset is taken as input, and contains 70,000 images of digits, each 28 2 pixels [8]. Each digit has been centered and normalized, but the dataset contains multiple handwriting styles; digits can be written multiple ways (e.g. “2” with or without a loop). Example digits are shown in Figure 2 (left). Using the raw images, an SVM was trained and tested, and the confusion matrix was computed as well as the mean digit classification accuracy. This provided a baseline accuracy for comparing other classifiers. A corruption algorithm was then applied to the MNIST images resulting in degraded image quality (Figure 2 right). The SVM classifier was run to compare the mean classification accuracy of digits. The final processing step applied two filtering algorithms to reconstruct the missing pixels, with the goal of improving SVM classifier accuracy. From these experiments, the effect of corruption and reconstruction was measured having isolated the individual algorithms. Figure 2 – (left) example digits from the MNIST dataset, (right) the same digits with 85% of the pixels removed, simulating sparse data. Note that some digits remain human-recognizable, but others do not. Methodology First, it is interesting to note that while the MNIST dataset appears to have a large number of features (one feature per pixel, 28 2 =784 total), these features are largely redundant. For instance, 8% of the pixels are always set to zero, as can be seen in the heat map of unique pixel values (Figure 3 left). Moreover, PCA indicates that 87 out of 784 features in the eigenbasis capture 90% of the variation in the dataset (Figure 3 right). Therefore, it is reasonable to

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

حذف نویز ضربه تصاویر با استفاده از فیلتر تطبیقی سوئیچ کننده مبتنی بر ماشین یادگیر بیشینه

In this paper a new efficient method for detecting the impulse noise from the corrupted image using extreme learning machine (ELM) is proposed. An improved version of the standard median filter is suggested to remove the detected noisy pixel. The performance of proposed detector is evaluated using classification accuracy. The results show that our detector is robust even at higher noise density...

متن کامل

Introduction to a simple yet effective Two-Dimensional Fuzzy Smoothing Filter

Annihilation or reduction of each kind of noise blended in correct data signals is a field that has attracted many researchers. It is a fact that fuzzy theory presents full capability in this field. Fuzzy filters are often strong in smoothing corrupted signals, whereas they have simple structures. In this paper, a new powerful yet simple fuzzy procedure is introduced for sharpness reduction in ...

متن کامل

Improved Adaptive Median Filter Algorithm for Removing Impulse Noise from Grayscale Images

Digital image is often degraded by many kinds of noise during the process of acquisition and transmission. To make subsequent processing more convenient, it is necessary to decrease the effect of noise. There are many kinds of noises in image, which mainly include salt and pepper noise and Gaussian noise. This paper focuses on median filters to remove the salt and pepper noise. After summarizin...

متن کامل

Development Hough transform to detect straight lines using pre-processing filter

Image recognition is one of the most important field in image processing that in recent decades had much attention .Due to expansion of related fields with image processing and various application of this science in machine vision, military science, geography, aerospace and artificial intelligence and lots of other aspects, out stand the importance of this subject.One of the most important aspe...

متن کامل

Impulsive Noise Elimination Considering the Bit Planes Information of the Image

Impulsive noise is one of the imposed defectives degrades the quality of images. Performance of many image processing applications directly depends on the quality of the input image. Hence, it is necessary to de-noise the degraded images without losing their valuable information such as edges. In this paper we propose a method to remove impulsive noise from color images without damaging the ima...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013